Toward Guidelines for Modeling Learning Agents in Multiagent-Based Simulation: Implications from Q-Learning and Sarsa Agents
نویسندگان
چکیده
This paper focuses on how simulation results are sensitive to agent modeling in multiagent-based simulation (MABS) and investigates such sensitivity by comparing results where agents have different learning mechanisms, i.e., Q-learning and Sarsa, in the context of reinforcement learning. Through an analysis of simulation results in a bargaining game as one of the canonical examples in game theory, the following implications have been revealed: (1) even a slight difference has an essential influence on simulation results; (2) testing in static and dynamic environments highlights the different tendency of results; and (3) three stages in both Q-learning and Sarsa agents (i.e., (a) competition; (b) cooperation; and (c) learning impossible) are found in the dynamic environment, while no stage is found in the static environment. From these three implications, the following very rough guidelines for modeling agents can be derived: (1) cross-element validation for specifying key factors that affect simulation results; (2) a comparison of results between the static and dynamic environments for determining candidates to be investigated in detail; and (3) sensitive analysis for specifying applicable range for learning agents.
منابع مشابه
Lessons Learned from Comparison Between Q-learning and Sarsa Agents in Bargaining Game
This paper focuses on sensitivity of learning mechanisms applied to agents in agent-based simulation and explores criteria for employing such learning mechanisms by comparing simulation results derived from agents who have different learning mechanisms. Specifically, we employ two types of reinforcement learning in this study, Q-learning and Sarsa. Through an analysis of simulation results in a...
متن کاملA Multiagent Reinforcement Learning algorithm to solve the Community Detection Problem
Community detection is a challenging optimization problem that consists of searching for communities that belong to a network under the assumption that the nodes of the same community share properties that enable the detection of new characteristics or functional relationships in the network. Although there are many algorithms developed for community detection, most of them are unsuitable when ...
متن کاملVoltage Coordination of FACTS Devices in Power Systems Using RL-Based Multi-Agent Systems
This paper describes how multi-agent system technology can be used as the underpinning platform for voltage control in power systems. In this study, some FACTS (flexible AC transmission systems) devices are properly designed to coordinate their decisions and actions in order to provide a coordinated secondary voltage control mechanism based on multi-agent theory. Each device here is modeled as ...
متن کاملA Reinforcement Learning Approach for Multiagent Navigation
This paper presents a Q-Learning-based multiagent system oriented to provide navigation skills to simulation agents in virtual environments. We focus on learning local navigation behaviours from the interactions with other agents and the environment. We adopt an environment-independent state space representation to provide the required scalability of such kind of systems. In this way, we evalua...
متن کاملDynamic Pricing Agents and Multiagent Learning
Abstract. We implemented three different types of pricing agents in a simulated economy. Each type of agent is based on a different learning method. The first method is simple reinforcement learning. The second method is the traditional Q-learning method. The third method is Nash Q-learning method. In each simulation, there are two agents, and a fixed amount of customers. The agent that charges...
متن کامل